Finding Characteristic Substructures for Metabolite Classes
نویسندگان
چکیده
We introduce a method for finding a characteristic substructure for a set of molecular structures. Different from common approaches, such as computing the maximum common subgraph, the resulting substructure does not have to be contained in its exact form in all input molecules. Our approach is part of the identification pipeline for unknown metabolites using fragmentation trees. Searching databases using fragmentation tree alignment results in hit lists containing compounds with large structural similarity to the unknown metabolite. The characteristic substructure of the molecules in the hit list may be a key structural element of the unknown compound and might be used as starting point for structure elucidation. We evaluate our method on different data sets and find that it retrieves essential substructures if the input lists are not too heterogeneous. We apply our method to predict structural elements for five unknown samples from Icelandic poppy. 1998 ACM Subject Classification J.2 Physical Sciences and Engineering (Chemistry)
منابع مشابه
The Study of Substructures of Addiction Phenomena in High School Students Using Problem Finding Workshops
Background: Addiction is one of the complicated problems in Iranian young population. The social and cultural dimensions of this social disease are less considered. So considering socio-cultural and environmental resources, this study investigated the substructures of addiction according to the viewpoints of high-school students of Kerman in 2007-2008.Methods: This qualitative study accomplishe...
متن کاملUnsupervised Discovery and Comparison of Structural Families Across Multiple Samples in Untargeted Metabolomics
In untargeted metabolomics approaches, the inability to structurally annotate relevant features and map them to biochemical pathways is hampering the full exploitation of many metabolomics experiments. Furthermore, variable metabolic content across samples result in sparse feature matrices that are statistically hard to handle. Here, we introduce MS2LDA+ that tackles both above-mentioned proble...
متن کاملCharacteristic Substructures in Sets of Organic Compounds with Similar Infrared Spectra
A method based on the determination of maximum common substructures is applied for the generation of substructures which are characteristic for a given set of molecular structures. The molecular structures are from hitlists obtained by spectral library searches; the hitlists contain those reference compounds, which have infrared spectra most similar to that from the query compound. The influenc...
متن کاملPrediction of metabolic reactions based on atomic and molecular properties of small-molecule compounds
MOTIVATION Our knowledge of the metabolites in cells and their reactions is far from complete as revealed by metabolomic measurements that detect many more small molecules than are documented in metabolic databases. Here, we develop an approach for predicting the reactivity of small-molecule metabolites in enzyme-catalyzed reactions that combines expert knowledge, computational chemistry and ma...
متن کاملRandom molecular substructures as fragment-type descriptors
A novel approach for analysis of structure-activity relationships for sets of active compounds is reported. Molecular similarity relationships are analyzed among compounds with related biological activity based on the evaluation of randomly generated fragment populations. Fragments are randomly generated by iterative bond deletion in molecular graphs and hence depart from those obtained by well...
متن کامل